Inferring complex phylogenies using parsimony: an empirical approach using three large DNA data sets for angiosperms.

نویسندگان

  • D E Soltis
  • P S Soltis
  • M E Mort
  • M W Chase
  • V Savolainen
  • S B Hoot
  • C M Morton
چکیده

To explore the feasibility of parsimony analysis for large data sets, we conducted heuristic parsimony searches and bootstrap analyses on separate and combined DNA data sets for 190 angiosperms and three outgroups. Separate data sets of 18S rDNA (1,855 bp), rbcL (1,428 bp), and atpB (1,450 bp) sequences were combined into a single matrix 4,733 bp in length. Analyses of the combined data set show great improvements in computer run times compared to those of the separate data sets and of the data sets combined in pairs. Six searches of the 18S rDNA + rbcL + atpB data set were conducted; in all cases TBR branch swapping was completed, generally within a few days. In contrast, TBR branch swapping was not completed for any of the three separate data sets, or for the pairwise combined data sets. These results illustrate that it is possible to conduct a thorough search of tree space with large data sets, given sufficient signal. In this case, and probably most others, sufficient signal for a large number of taxa can only be obtained by combining data sets. The combined data sets also have higher internal support for clades than the separate data sets, and more clades receive bootstrap support of > or = 50% in the combined analysis than in analyses of the separate data sets. These data suggest that one solution to the computational and analytical dilemmas posed by large data sets is the addition of nucleotides, as well as taxa.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reconstructing phylogenies from allozyme data: comparing method performance with congruence

Allozyme data are widely used to infer the phylogenies of populations and closely-related species. Numerous parsimony, distance, and likelihood methods have been proposed for phylogenetic analysis of these data; the relative merits of these methods have been debated vigorously, but their accuracy has not been well explored. In this study, I compare the performance of 13 phylogenetic methods (si...

متن کامل

The Root of Flowering Plants and Total Evidence.

Support for Amborella as the sole survivor of an evolutionary lineage that is sister to all other angiosperms comes from positions in DNA multiple-sequence alignments that have a poor fit to time-reversible substitution models. These sites exhibit significant levels of homoplasy, compositional heterogeneity, and strong heterotachy. We report phylogenetic analyses with observed, randomized, and ...

متن کامل

Prospects for inferring very large phylogenies by using the neighbor-joining method.

Current efforts to reconstruct the tree of life and histories of multigene families demand the inference of phylogenies consisting of thousands of gene sequences. However, for such large data sets even a moderate exploration of the tree space needed to identify the optimal tree is virtually impossible. For these cases the neighbor-joining (NJ) method is frequently used because of its demonstrat...

متن کامل

Comparing Different Operators and Models to Improve a Multiobjective Artificial Bee Colony Algorithm for Inferring Phylogenies

Maximum parsimony and maximum likelihood approaches to phylogenetic reconstruction were proposed with the aim of describing the evolutionary history of species by using different optimality principles. These discrepant points of view can lead to situations where discordant topologies are inferred from a same dataset. In recent years, research efforts in Phylogenetics try to apply multiobjective...

متن کامل

Title: A weighted least-squares approach for inferring phylogenies from incomplete distance matrices Authors:

Motivation: The problem of phylogenetic inference from data sets including incomplete or uncertain entries is among the most relevant issues in systematic biology. In this paper, we propose a new method for reconstructing phylogenetic trees from partial distance matrices. The new method combines the usage of the four-point condition and the ultrametric inequality with a weighted least-squares a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Systematic biology

دوره 47 1  شماره 

صفحات  -

تاریخ انتشار 1998